An Effective Approach to Biomedical Information Extraction with Limited Training Data

نویسنده

  • Siddhartha Jonnalagadda
چکیده

S i m p l i f i e d A b s t r a c t P I E P I E R e m o v e A n n o t a t i o n s O R C o m b i n a t i o n R e s u l t s f o r o r i g i n a l s e n t e n c e s R e s u l t s f o r S i m p l i f i e d s e n t e n c e s A I M e d C o m p a r i s o n o f d i f f e r e n t m e t h o d s W o r k F l o w f o r E a c h A b s t r a c t Thus, addressing the false negatives (case d) without increasing false positives (case b) would increase both the recall and precision of the system. Cases a) and c) do not affect the system’s performance. Case b presents a slight likelihood for a decrease in precision (where the simplified sentence

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information Extraction from Biomedical Texts: Learning Models with Limited Supervision

Among the application domains of information extraction, the biomedical domain is one of the most important ones. This is due to the large amount of biomedical text sources including the vast scientific literature and collections of patient reports written in natural language. These sources contain a wealth of crucial knowledge that needs to be mined. Typical mining tasks regard entity recognit...

متن کامل

کاهش ابعاد داده‌های ابرطیفی به منظور افزایش جدایی‌پذیری کلاس‌ها و حفظ ساختار داده

Hyperspectral imaging with gathering hundreds spectral bands from the surface of the Earth allows us to separate materials with similar spectrum. Hyperspectral images can be used in many applications such as land chemical and physical parameter estimation, classification, target detection, unmixing, and so on. Among these applications, classification is especially interested. A hyperspectral im...

متن کامل

Learning to Extract Relations from MEDLINE

Information in text form remains a greatly underutilized resource in biomedical applications. We have begun a research effort aimed at learning routines for automatically mapping information from biomedical text sources, such as MEDLINE, into structured representations, such as knowledge bases. We describe our application, two learning methods that we have applied to this task, and our initial ...

متن کامل

A PCA/ICA based Fetal ECG Extraction from Mother Abdominal Recordings by Means of a Novel Data-driven Approach to Fetal ECG Quality Assessment

Background: Fetal electrocardiography is a developing field that provides valuable information on the fetal health during pregnancy. By early diagnosis and treatment of fetal heart problems, more survival chance is given to the infant.Objective: Here, we extract fetal ECG from maternal abdominal recordings and detect R-peaks in order to recognize fetal heart rate. On the next step, we find a be...

متن کامل

Maximum Entropy Learning Model for Biomedical Semantic Type Induction

In the biomedical literature the gene names tend to be used to refer to biomedical entities other than genes. Therefore, when performing information extraction tasks it is necessary to distinguish between all such biomedical entities. Our approach consists of training a Maximum Entropy classifier for this task using a large automatically-created training corpus, as opposed to manually annotated...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1107.5752  شماره 

صفحات  -

تاریخ انتشار 2011